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© IMAGE SIGNAL CONVERTER. 



© A digitally inputted image signal (SD signal) is converted into a high resolution digital image signal (HD 
^ signal) by using estimated values. The specific pixels of an object to be estimated are classified according to the 
^ one-, two-, or three-dimension level distribution of plural reference pixels of SD signal located near the specific 
00 pixels. The estimated values of the specific pixels are generated by a linear combination of the values of the 
^ reference values and the estimation coefficients determined in advance by learning. During the learning, a known 

HD signal and the SD signal formed from it are used to determine the estimation coefficients so as to minimize 
tf> the sum of squares of the errors between the true values and the values estimated by the linear combination of 
^ the values of the surrounding SD signal reference pixels and the estimated coefficients. Not necessarily limited 

to the estimated coefficients, it may be possible to use. as the estimated values, representatives determined for 
O every class in correspondence with the class of the inputted SD signal. The reference value of a block and the 

value normalized by the dynamic range DR are an example as the representatives. 
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Technical Field 

The present invention relates to an image signal converting apparatus suitable for so-called up- 
conversion where an image signal with a standard resolution is converted into an image signal with a high 
resolution. 



Background Art 

For example, in the case of TV signals, in addition to standard resolution (or standard definition SD) TV 

w signals, high definition (or high definition HD) TV signals have been experimentally used for broadcasting. 
TV receivers that can receive HD signals have been practically used. In other words, the HD TV system is 
going to coexist with the SD TV system. 

When the SD TV system and the HD TV system coexist, a signal converting apparatus for allowing an 
HD TV monitor to reproduce SD TV video signals will be required. Although various standards for the SD 

75 signals and the HD signals have been proposed, now assume an NTSC system for SD signals (number of 
scanning lines = 525, number of fields = 60, aspect ratio = 4:3) and an HDTV system for HD signals 
(number of scanning lines = 1125, number of fields = 60, aspect ratio = 16 : 9). The resolution of the HD 
signals is four times higher than the resolution of the SD signals. Thus, the converting apparatus should 
perform an up-converting process that increases the number of pixels of the input SD signal four times so 

20 as to obtain the resolution of an HD signal. 

Conventionally, the signal converting apparatus uses interpolating filters. Fig. 1 shows a block diagram 
of a conventional signal up-converting apparatus. An SD signal is supplied from an input terminal 1. The SD 
signal is sent to a horizontal interpolating filter 2 that increases the number of pixels of the SD signal twice 
in horizontal direction. The output of the horizontal interpolating filter 2 is sent to a vertical interpolating filter 

25 3 that increases the number of pixels of the output signal of the horizontal interpolating filter 2 twice in 
vertical direction. The output of the vertical interpolating filter 3 is an HD signal. The HD signal is sent to an 
output terminal 4. In the conventional signal converting apparatus, the image is up-converted with such 
filters. For example, 0 data is inserted into an interpolating point so that the sampling frequency of the SD 
signal accords with the sampling frequency of the HD signal. An interpolated value corresponding to the 

30 interpolating point is formed by each filter. 

Fig. 2 shows a construction of each of the interpolating filters. An SD signal is supplied to an input 
terminal 5. The SD signal is sent to multipliers that multiply the SD signal by filter coefficients a n ,a n -,, ... a 0 . 
Each of outputs of the multipliers is sent to a register with a unit delay amount T. The output of each of the 
multipliers and the output of each of the registers T are added. An interpolated output is sent to an output 

35 terminal 6. In the horizontal interpolating filter 2, the unit delay amount T accords with a sample period. In 
the vertical interpolating filter 3, the unit delay amount T accords with a line period. 

In the conventional image signal converting apparatus, since the SD signal is up-converted into the HD 
signal with the filters, the output HD signal is generated by processing the input SD signal. Thus : the 
resolution of the output HD signal is not higher than the resolution of the input SD signal. 

40 An object of the present invention is to provide an image signal converting apparatus that can 
compensate the resolution of the output signal. 

Another object of the present invention is to provide an image signal converting apparatus that 
categorizes the input signal as classes corresponding to local features of the input image signal so as to 
improve conversion accuracy. 

45 

Disclosure of Invention 



A first aspect of the present invention is a signal converting apparatus for converting a first digital image 
signal into a second digital image signal having higher resolution than the first digital image signal, 

so comprising a class categorizing means for categorizing a considered pixel of the second digital image 
signal as a class corresponding to a pattern of level distribution of a plurality of reference pixels of the first 
digital image signal, the reference pixels of the first digital image signal being disposed spatially and/or 
chronologically adjacent to the considered pixel, a memory means for storing predicted coefficients for each 
class and for outputting the predicted coefficients corresponding to class information received from the 

55 class categorizing means, and a predicted value generating means for calculating linear combination of the 
values of the pixels of ihe first digital image signal spalially and/or chronologically adjacent to the 
considered pixel so as to generate a predicted value of the considered pixel. The predicted coefficients are 
pre-learnt and pie-determined. 
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A second aspect of the present invention is the signal converting apparatus of first aspect further 
comprising a learning device, comprising a means for forming the first digital image signal with the second 
digital image signal, a class categorizing means for categorizing a considered pixel of the second digital 
image signal as a class corresponding to a pattern of level distribution of a plurality of reference pixels of 
the first digital image signal, the reference pixels of the first digital image signal being disposed spatially 
and/or chronologically adjacent to the considered pixel, and a means for determining predicted coefficients 
for each class so that the sum of square of an error between the true value of the considered pixel and the 
predicted value becomes minimum when the predicted value of the considered pixel is generated by linear 
combination of a plurality of pixels of the first digital image signal and the predicted coefficients. 

As a third aspect of the present invention, instead of predicted coefficients, representative values for 
each class are used as a predicted value of the second digital image signal. 

As a fourth aspect of the present invention, a class categorizing process is performed corresponding to 
the pattern of the level distribution of one-dimensional, two-dimensional, or three-dimensional pixel arrange- 
ment of the first digital image signal. When the pattern of the level distribution is obtained, the number of 
bits is compressed by compressing and encoding technique. 

Brief Description of Drawings 

Fig. 1 is a block diagram showing a conventional image signal converting apparatus; Fig. 2 is a block 
diagram showing an interpolating filter of Fig. 1; Fig. 3 is a block diagram showing an image signal 
converting apparatus according to a first embodiment of the present invention; Fig. 4 is a schematic 
diagram showing a two-dimensional arrangement of pixels of an SD signal and an HD signal; Fig. 5 is a 
schematic diagram showing a one-dimensional arrangement of pixels of an SD signal and an HD signal; Fig. 
6 is a graph for explaining a change of a signal level in ADRC; Fig. 7 is a schematic diagram for explaining 
quantizing characteristics in ADRC; Fig. 8 is a block diagram showing an encoding circuit corresponding to 
one-bit ADRC technique; Fig. 9 is a table showing SD pixels used in predicting values of a plurality of HD 
pixels; Fig. 10 is a flow chart for explaining a process of an embodiment of the present invention; Fig. 1 1 is 
a block diagram schematically showing a construction of a circuit block for a learning process for 
determining predicted coefficients; Fig. 12 is a flow chart for explaining a learning process; Fig. 13 is a 
schematic diagram showing the relation of vertical positions of SD pixels and HD pixels in interlace 
scanning; Fig. 14 is a schematic diagram showing an arrangement of SD pixels in model 1; Fig. 15 is a 
schematic diagram showing an arrangement of SD pixels in model 2; Fig. 16 is a schematic diagram 
showing an arrangement of SD pixels in model 3; Fig. 17 is a schematic diagram showing an arrangement 
of SD pixels in model 4; and Fig. 18 is a block diagram showing an apparatus for converting an image 
signal corresponding to predicted coefficients of each mode. 

Best Mode for Carrying Out the Invention 

Next, with reference to the accompanying drawings, preferred embodiments of the present invention will 
be described. Fig. 3 shows the construction of an embodiment of the present invention. Reference numeral 
11 is an input terminal to which an SD signal is supplied. The SD signal is for example a NTSC digital video 
signal with a predetermined sampling frequency. One pixel of the SD signal is composed of eight bits. The 
input SD signal is sent to a block segmenting circuit 12 that converts raster scanning sequence data into 
one-dimensional, two-dimensional, or three-dimensional block sequence data. 

The output of the block segmenting circuit 12 is sent to a data compressing circuit 13 and a predicted 
value generating circuit 16. The output of the data compressing circuit 13 is sent to a class code generating 
circuit 14. The output of the class code generating circuit 14 is an address signal and sent to a memory 15. 
The memory 15 stores predicted coefficients that have been learnt. The predicted coefficients stored in the 
memory 15 are supplied to a predicted value generating circuit 16. 

The data compressing circuit 13 and the class code generating circuit 14 categorize a considered pixel 
to be predicted as a class corresponding to a pattern of level distribution of each block containing the 
considered pixel and. The output of the class code generating circuit 14 is a class code that represents the 
determined class. Although the considered pixel is categorized as a class corresponding to the SD signal, if 
a plurality of pixels (eight bits of pixels adjacent to the considered pixel) of the SD signal are referenced 
(these pixels are referred to as SD pixels), the class number becomes large. To prevent the class number 
from becoming large, the data compressing circuit 13 is provided. The data compressing circuit 13 
compresses the number of bits of the adjacent pixels to be referenced (namely, referenced pixels). An 
example of the data compressing circuit 13 is an ADRC circuit. 
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To compress data, various techniques such as DCT (Discrete Cosine Transform), VQ (Vector Quan- 
tizing). DPCM (Differential Pulse Code Modulation), BCT (Block Truncation Coding), and non-linear quan- 
tizing may be selectively used as well as the ADRC (Adaptive Dynamic Range Coding) technique. 

As shown in Fig. 4, the block segmenting circuit 12 converts raster scanning sequence data of the TV 
signal into two-dimension block sequence data. In the example shown in Fig. 4, one block is composed of 
(3 x 3) SD pixels denoted by a to i. The predicted value generating circuit 16 generates the values of four 
HD pixels A to D nearly at the center of the block corresponding to the SD pixels of the block. 

Instead of the block shown in Fig. 4, the predicted value generating circuit 16 may generate predicted 
values of HD pixels corresponding to a one-dimensional block of four SD pixels a to d as shown in Fig. 5. 
Moreover, the present invention may be applied to a three-dimensional block that will be descried later. 

In the one-dimensional arrangement shown in Fig. 5, the HD pixel A is categorized as a class and a 
predicted value thereof is generated corresponding to the SD pixels a, b, and c. The HD pixel B is 
categorized as a class and a predicted value thereof is generated corresponding to the SD pixels a, b, c, 
and d. Likewise, these class categorizing process and a predicted value generating process apply to the 
learning operation. 

Next, the ADRC technique used for the data compressing circuit 13 will be described. In the ADRC 
technique, redundancy of level is adaptively removed corresponding to the correlation of local features of 
pixels. As shown in Fig. 6, it is clear that dynamic ranges A and B in each block that is re-quantized are 
much smaller than the original dynamic range of 0 to 255 of the original data composed of eight bits. Thus, 
the number of bits necessary for re-quantizing these blocks is much smaller than the eight bits. 

Now, assume that the number of assigned bits that is smaller than the number of original bits (eight 
bits) is p, the dynamic range of a block is DR, the value of a pixel in the block is x, and a re-quantized code 
is Q. With formula (1). the level between maximum value MAX and minimum value MIN is equally divided 
into 2 P portions and then re-quantized. Fig. 7(a) shows a re-quantizing operation of a level where p is 3. 

DR = MAX - MIN + 1 

Q = [(x - MIN + 0.5) x 2 p / DR] (1) 

where [z] represents the maximum integer that is z or less. 

Next, the mean value of pixels in the block having the data level equivalent to (2 P - 1) in the p-bit re- 
quantizing graduation level in Fig. 7(a) is calculated. The calculated result is a new maximum value MAX'. 
The mean value of the pixels in the block having the data level equivalent to the re-quantizing graduation 
level 0 is a new minimum value MIN'. With the new maximum value MAX' and the new minimum value 
MIN', the dynamic range is defined. With formula (2), the re-quantizing operation is performed. 

DR' = MAX' - MIN' 

q = [(x - MIN*) x (2 p - 1) / DR' + 0.5] (2) 

where [z] represents the maximum integer that is z or less. 

In the ADRC technique where the new maximum value MAX', the minimum value MIN', and the 
dynamic range DR' are re-defined, information amount can be effectively compressed. The compressed 
result is not affected by noise. 

As a quantizing operation corresponding to the ADRC technique, representative level being restored 
may have the same level as the maximum value MAX and the minimum value MIN. 

As an example of a two-dimensional block, corresponding to the above-described ADRC technique, 
values of n pixels where eight bits with values a to i have been compressed to p bits are sent to the class 
code generating circuit 14. Thus, a code class representing class c is generated corresponding to formula 
(3). 

class = l q i (2 p ) i ( 3 ) 



Next, wilh reference to Fig. 3. an ADRC ciicuit corresponding to the one-bit ADRC technique of will be 
described. In Fig. 8, a block sequence data is supplied lo an input terminal 21. The block sequence data is 
sent to a detecting circuit 22 that detects the maximum value MAX and the minimum value MIN of pixels 
for each block. The MAX and MIN are sent to a subtracting ciicuit 23. The output of the subtracting circuit 
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23 is a dynamic range DR. In addition, the input data and the MIN are sent to a subtracting circuit 24. Since 
the subtracting circuit 24 subtracts the MIN from the input data, the output thereof is normalized pixel data. 

The dynamic range DR is sent to a dividing circuit 25 that divides the normalized pixel data by the 
dynamic range DR. The output of the dividing circuit 25 is sent to a comparing circuit 26. The comparing 
circuit 26 determines whether or not each of the divided outputs of nine pixels is larger than 0.5. The output 
of the comparing circuit 26 is one-bit data DT whose value is "0" or "1". When the divided output is larger 
than 0.5, the value of data DT is "1". Otherwise, the value of data DT is "0". The data DT is sent to an 
output terminal 27. Corresponding to the one-bit ADRC technique, a block of (3 x 3) SD pixels can be 
categorized as a 9-bit class code. 

Returning to Fig. 3, predicted coefficients corresponding to the class code are read from the memory 
15. The predicted value generating circuit 16 generates a predicted value y' of an HD pixel corresponding 
to linear combination of SD data of each block supplied from the block segmenting circuit 12 and the 
predicted coefficients w, to w n . The linea combination is given by formula (4). 

y' = WtXi + w 2 x 2 + .... + w n x„ (4) 

In the example shown in Fig. 4, (n = 1,2, .... , 9). In the relation shown in Fig. 9, predetermined SD data 
are denoted by x1 to x9. 

In other words, when predicted values for four HD pixels A to D of one block are generated, SD pixels 
for the predicted coefficients are changed corresponding to the considered HD pixels. When the pixel A is a 
considered HD pixel the predicted value of the pixel A is generated corresponding to formula (5). 

y ' = Wl a + w 2 b + w 3 c + + w n i (5) 

When the pixel B is a considered HD pixel, the predicted value of the pixel B is generated corresponding to 
formula (6). 

y' = wia + w 2 d + w 3 c + .... + w n e (6) 

Thus, coefficients for generating predicted values of four HD pixels of the same block (namely, the same 
class) can be used in common. Thus, the storage capacity of the memory 15 can be reduced and the 
number of times of the access to the memory 15 can be decreased. The relation between the values xi to 
x 9 multiplied by the coefficients shown in Fig. 9 and the SD pixels a to i is defined corresponding to the 
distance between a considered HD pixel and SD pixels. 

X! to x 9 may be considered as nine bits of a class code formed by the one-bit ADRC technique. In 
other words, "the classes for the HD pixels A to D in the vicinity of the center of one block can be defined by 
changing the sequence of the values where the SD pixels a to i have been compressed and encoded. 
These values can be read from the memory by a different reading sequence. 

Fig. 10 is a flow chart showing an up-converting process where an SD signal is converted into an HD 
signal according to the present invention. At step 31 "START", the control of the up-converting process is 
started. At step 32 "SEGMENT DATA INTO BLOCK", an SD signal is supplied. At this step, SD pixels are 
extracted for each block as shown in Fig. 4. At step 33 "DATA END ?", if all input data has been 
completely processed, the flow advances to step 37 "END". Otherwise, the flow advances to step 34 
"DETERMINE CLASS". 

At step 34 "DETERMINE CLASS", a class is determined corresponding to a pattern of distribution of 
the signal level of the SD signal. For example, a class is determined corresponding to data compressed by 
for example one-bit ADRC technique. At step 35, predicted coefficients corresponding to the class code are 
read from the memory. At step 36 "CALCULATE PREDICTED VALUE", the calculation is performed 
corresponding to the formula (4) and the predicted value y' of the HD pixel is output. The sequence of 
these steps is repeated for all data. When all data has been completely processed, the flow advances from 
step 33 "DATA END" to step 37 "END". Thus, the up-converting process is completed. 

As described above, the memory 15 stores the predicted coefficients that have been learnt. Next, the 
learning process will be described. Fig. 1 1 is a block diagram showing a circuit block for performing the 
learning process according to an embodiment of the present invention. 

Reference numeral 41 is an input terminal into which a standard HD signal that is a large number of still 
images is supplied. The HD signal is sent to a filler 42 and a learning portion 44. The vertical thin-out filter 
42 thins out the HD signal in vertical direction so as to reduce the number of pixels of the HD signal in 
vertical direction by 1/2. The output of the vertical thin-out filter 42 is sent to a horizontal thin-out filter 43. 
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The horizontal thin-out filter 43 thins out the HD signal in horizontal direction so as to reduce the number of 
pixels of the HD signal in horizontal direction by 1/2. -Thus, an image signal with pixels same as an SD 
signal is sent to a learning portion 44. A predicted coefficient memory 45 stores predicted coefficients wl to 
wn at an address corresponding lo the class determined by a class categorizing circuit in the learning 
5 portion 44. 

The method for forming an SD signal from an HD signal is not limited to the above-described method 
where the thin-out filters are used. Instead, as in the pixel arrangement shown in Fig. 4, the value of an SD 
pixel a may be formed by the mean value of (2 x 2) pixels (for example A, B, C, and D). Alternatively, unlike 
with such a simple mean value, a weight mean value of HD pixels in a wide range (for example, HD pixels 
w in one block) may be used for an SD pixel. 

As in the arrangement shown in Fig. 4, when one block is composed of (3 x 3) SD pixels, the SD pixels 
a to i and the HD pixels A. B, C. and D become one set of learnt data. When there are many sets of learnt 
data for one frame and the number of frames is increased, a large number of sets of learnt data can be 
used. 

75 The learning portion 44 performs both a class categorizing process and a calculating process. As the 
class categorizing process, the learning portion 44 compresses the SD pixels a to i and categorizes the 
pattern of the two-dimensional distribution of the values of the compressed SD pixels as a class. As the 
calculating process, the learning portion 44 determines predicted coefficients of each class corresponding 
to the method of least squares. The class categorizing process performed by the learning portion 44 is the 

20 same as the process performed by the data compressing circuit 13 and the class code generating circuit 14 
shown in Fig. 3. This learning portion 44 may be accomplished by software. Fig. 12 is a flow chart showing 
the process of software for the learning portion 44. 

At step 51, the control of the learning portion 44 is started. At step 52 "SEGMENT CORRESPONDING 
DATA INTO BLOCK", an HD signal and an SD signal are supplied and a process for extracting the HD 

25 pixels (A to D) and the SD pixels (a to i) in the arrangement shown in Fig. 4 is performed. At step 53 "DATA 
END ?", if data process for all data (for example, one frame) has been completed, the flow advances to 
step 56 "DETERMINE PREDICTED COEFFICIENTS". Otherwise, the flow advances to step 54 "DETER- 
MINE CLASS". 

At step 54 "DETERMINE CLASS", a class is determined corresponding to the pattern of the level 

30 distribution of the SD pixels (a to i) in the vicinity of the HD pixels (A to D of Fig. 4) to be predicted. In this 
process, as described above, so as to reduce the number of bits, the SD pixels are compressed by for 
example the ADRC technique. At step 55 "GENERATE NORMAL EQUATIONS", formulas (12). (13), and 
(14), which will be descried later, are generated. 

At step 53 "DATA END ?", it is determined whether or not all data has been completely processed. 

35 When all data has been completely processed, the flow advances to step 56. At step 56 "DETERMINE 
PREDICTED COEFFICIENTS", formula (14), which will be described later, is solved by matrix solution so as 
to determine predicted coefficients. At step 57 "STORE PREDICTED COEFFICIENTS", the predicted 
coefficients are stored in the memory. At step 58 "END", the sequence of the steps of the process of the 
learning portion 44 is completed. 

40 Like the above-described signal converting apparatus, with the pixel arrangement shown in Fig. 4, the 
SD pixels a to i are compressed and encoded. The encoded values are categorized as a class. Predicted 
values of the HD pixels A to D are generated by linear combination of the values of the SD pixels a to i and 
the predicted coefficients wi to W n with a combination shown in Fig. 9. 

Next, the process for extracting predicted coefficients for HD pixels from the values of SD pixels will be 

45 described in detail. Assume that the values of SD pixels are Xi to x n and the true value of the considered 
HD pixel is y. For each class, a linear combination of n taps (coefficients wi to w n ) is given as follows. 

y" = W1X1 + w 2 x 2 + .... + w n x n (7) 

50 In formula (7), before the learning process is performed, Wj is an unknown coefficient. 

As described above, the learning process is performed for a plurality of HD data and a plurality of SD 
data for each class. When the number of data is m, formula (7) is modified to formula (8). 

yj* = WiXp + W 2 X j2 + + W n X jn (8) 

55 

where j = 1,2, .... , m. 

When m > n, since wi to w n are not unique values, elements of an error vector are defined as follows. 
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V 



ej = yj - (wix n + w 2 x i2 + .... + w n x ]n ) (9) 
where j = 1,2,... m. 

Coefficients thai minimize the following formula are obtained. 



70 



E 2 = S (ej) 2 



(10) 



In other words, the method of least squares is used. Partial differential coefficients are obtained with 
respect to w s of formula (10). 



75 



(11) 



20 



wi is obtained so that the value of the formula (11) becomes 0. Thus, assume 



25 



p=0 



pj 



(12) 



30 



W 

= 2 X 

j'O 



J 1 



(13) 



35 



The formulas (12) and (13) can be expressed with a matrix. 



40 



^11 ^12 



22 



*2n 



(14) 



^5 



50 



55 



Using conventional matrix solution such as sweeping-out method, the predicted coefficient w } can be 
solved. The predicted coefficient wi is stored in the memory with an address of a class code. 

Thus, the learning portion 44 can determine the predicted coefficient w; with the HD signal that is real 
data. The predicted coefficient Wj is stored in the memory. The predicted coefficients determined by the 
learning portion 44 are stored in the memory 15 shown in Fig. 3. 

The class categorizing process and the predicted value generating process of the above-described 
embodiment are performed corresponding to the values of SD pixels that are one-dimensionally or two- 
dimensionally arranged. Next, the class categorizing process corresponding to the values of SD pixels that 
are three-dimensionally arranged will be described. When SD pixels that are three-dimensionally arranged 
are used, chronological information can be used for the class categorizing process and the predicted value 
generating process. In addition, in the case of the interlace scanning TV signal, since line distance in a field 
is large, the predicting accuracy of the process of the field can be prevented from lowering. 
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In the conventional signal converting apparatus, to use moving information, the motion of an image is 
detected. Moving pixels are processed in a field, whereas still pixels are processed among fields. The 
output signal of each process is mixed with a coefficient corresponding to the amount of motion. Thus, in 
the conventional apparatus, a motion delecting circuit is additionally required. In addition, unless the motion 

5 is accurately delected, image deterioration tends to occur. The class categorizing process and the predicted 
value generating process using SD pixels that are three-dimensionally ar ranged can solve such problems. 

As described above, the class categorizing process and the predicted value generating process for the 
signal converting apparatus, which converts an SD signal into an HD signal, are the same as these 
processes for the learning portion. Next, the class categorizing process and the predicted value generating 

w process for SD pixels that are three-dimensionally arranged will be described. First, the relation of positions 
of lines of an HD signal and an SD signal will be explained. 

As with the learning process, when an HD signal is converted into an SD signal, pixels on three 
successive lines of the HD signal arc multiplied by non-symmetrical coefficients such as 1/8, 4/8, and 3/8 as 
shown in Fig. 13, one line of the SD signal is formed. These non-symmetrical coefficients are used in the 

75 reverse order for each field so as to maintain the interlace structure of the SD signal. In other words, if two 
lines of the HD signal are combined and one line of an SD image is formed, since the intervals of lines in 
the n-th field do not accord with the intervals of lines in the (n + 1)-th field, the interlace structure will be 
lost. 

For the class categorizing process and the predicted value generating process, for example 12 SD 
20 pixels that are three-dimensionally distributed are used. There are four patterns of 12 SD pixels correspond- 
ing to the position of one HD pixel to be predicted. These four patterns are referred to as mode 1, mode 2, 
mode 3, and mode 4. Figs. 14, 15, 16, and 17 show arrangements of pixels corresponding to mode 1, mode 
2, mode 3, and mode 4. respectively. In Figs. 14 to Fig. 17, circles represent HD pixels to be predicted and 
squares represent SD pixels to be used for class categorizing process and predicted value generating 
25 process. 

For the simplicity of hardware, in Fig. 15 and 17, SD pixels in the (n - 1)-th field and the (n + 1)-th field 
are partially averaged so as to form SD pixels in the n-th field. The SD pixels in the n field are denoted by 
squares with dotted lines. Figs. 14, 15, 16, and 17 each show a plurality of SD pixels and one HD pixel 
where the SD pixels are used for the class categorizing process and the predicted value generating process 
30 and the HD pixel is to be predicted. 

In mode 1 (arrangement shown in Fig. 14), six SD pixels in the n-th field, three SD pixels in the (n - 1)- 
th field, and three SD pixels in the (n + 1)-th field (a total of 12 SD pixels) are used to perform the class 
categorizing process and the predicted value generating process for an HD pixel. In mode 1, an HD pixel on 
the line where a coefficient of 1/8 is multiplied is predicted (see Fig. 13). 
35 In mode 2 (arrangement shown in Fig. 15), SD pixels on the same line in the (n - 1)-th field and the (n 
+ 1)-th field are averaged so as to interpolate three SD pixels that are required in the n-th field. The six SD 
pixels including the interpolated SD pixels in the n-th field, three SD pixels other than SD pixels used for 
averaging in the (n - 1)-th field, and three SD pixels other than SD pixels used for averaging in the (n + 1)- 
th field are used. Thus, a total of 12 SD pixels are used. In mode 2, an HD pixel on the line where a 
40 coefficient of 4/8 is multiplied is predicted (see Fig. 13). 

In mode 3 (arrangement of Fig. 16), six SD pixels in the n-th field, four SD pixels in the (n - 1)-th field, 
and two SD pixels in the (n + 1)-th field (a total of 12 SD pixels) are used. In mode 3, an HD pixel on the 
line where a coefficient of 1/8 is multiplied is predicted (see Fig. 13). 

In mode 4 (arrangement of Fig. 17), SD pixels in the (n - 1)-th field and (n + 1)-th field are averaged so 
45 as to interpolate four SD pixels (two pixels each on two lines) that are required in the n-th field. The eight 
SD pixels including the interpolated SD pixels in the n-th field, two SD pixels other than the SD pixels used 
for averaging in the m - 1)-th field, and two SD pixels other than the SD pixels used for averaging in the (n 
+ l)-th field are used. Thus, a total of 12 SD pixels are used. In mode 4, an HD pixel on the line where a 
coefficient of 4/8 is multiplied is predicted (see Fig. 13). 
50 As described above, in modes 3 and mode 4, an HD pixel is generated at the 1/2 position of the 
sampling intervals of SD pixels. In other words, in mode 1 to mode 4. an HD pixel with horizontal and 
vertical resolutions that are twice those of an SD pixel is generated. 

In the learning process, predicted coefficients of each of mode I to mode 4 are determined by the 
method of least squares for each class and stored in the memory. Fig. 18 shows a construction of a signal 
55 converting apparatus that generates an output HD image corresponding to an input SD image. 

A signal dO that is an SD image in the (n + 1)-th field, is supplied lo an input terminal 61. The dO signal 
is sent to a field memory 62 and up-converting circuits 64a to 64d. The field memory 62 outputs an SD 
image signal in n-th field. This signal is denoted by signal d1. The signal d1 is sent to the up-converting 
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circuits 64a to 64d. The field memory 63 outputs an SD image signal in the (n - 1)-th field. This signal is 

referred to as sinal 62. The signal d2 is sent to the up-converting circuits 64a to 64d. 

An HD image is formed corresponding to signals dO, dl, and d2 of the SD image sent to the up- 

converling circuits 64a to 64d. The up-converting circuits 64a to 64d perform signal conversions of the 
5 above-described mode 1 to mode 4. A selector 65 is controlled with a select signal received from an input 

terminal 67. The select signal is a signal that designates a mode of a considered pixel. The output of the 

selected up-converting circuit is sent to an output terminal 66. 

The up-converting circuits 64a to 64d in Fig. 18 has the same construction except that predicted 

coefficients stored therein differ from each other. More practically, except that SD signals in three 
10 chronologically successive fields are supplied to a block segmenting circuit and a three-dimensional block 

is formed, the construction of the up-converting circuits 64a to 64d is the same as the construction of the 

circuit block shown in Fig. 3. 

In the above-described embodiment, by learning the relation between a known HD signal and an SD 

signal formed thereof, predicted coefficients for each class are determined. With the predicted coefficients 
75 and an input SD signal, an HD signal is generated. Thus, the resolution of the HD signal is compensated. In 

addition, since a class is adaptively selected corresponding to the distribution of the level of the SD signal, 

the SD signal can be up-converted corresponding to local characteristics of the image. 

Moreover, a class is determined corresponding to the three dimensional (chronological) distribution of 

the level of an input SD signal. The SD signal is converted corresponding to predicted coefficients that have 
20 been determined and learnt for each class. Thus, since chronological information can be effectively used, a 

converting image signal can be accurately output for a moving image. Furthermore, since the chronological 

information can be effectively used, an interlace signal can be effectively converted. 

In the above-described embodiment, the same SD pixels are used for both the class categorizing 

process and the predicted value generating process. However, it is not necessary to do so. For example, 
25 part of SD pixels used for the predicted value generating process may be used for the class categorizing 

process. 

When an HD signal is predicted, besides linear combination of a plurality of SD pixels and predicted 
coefficients, the values of pixels of the HD signal or normalized values thereof (referred to as representative 
values) may be pre-determined by the learning process and these values may be stored in memory. 

30 Next, a second embodiment of the present invention will be described. In this embodiment, normalized 

values of pixels of an HD signal are used. As with the first embodiment of the present invention, a class is 
determined corresponding to a pattern of a first-dimensional distribution, second-dimensional distribution, or 
three-dimensional distribution of SD pixels. Predicted values of each class are learnt and determined. For 
example, in the case of SD pixels that are two-dimensionally arranged shown in Fig. 4, SD pixels that are 

35 used for the class categorizing process for HD pixels A to D of a block are selected in the relation shown in 
Fig. 9. 

As an example, now assume that one of HD pixels A to D that are two-dimensionally distributed is 
considered as shown in Fig. 4. In addition, assume that the cumulated value of learnt data of class c at the 
present time is SU(c) and the number of times of cumulation is n(c). The arithmetic operations given by 
40 formula (15) are repeatedly performed for the learnt data. 

SU(c) = (hd - base) / DR 
n (c) = n(c) + 1 (15) 

45 where hd is the true value of an HD pixel of the learnt data, base is a base value of the block, and DR is a 
dynamic range of the block. The base value may be selected from the minimum value of the SD pixels in 
the block, the value of the SD pixel that is most close to the considered HD pixel, the value of the SD pixel 
at a predetermined position in the block, the simple mean value (weighted mean value) of the values of the 
SD pixels in the block, or the like. The value of center of gravity g (normalized value) is obtained by formula 

50 (16). 

g(c) = SU(c) / n(c) (16) 

The value of center of gravity g(c) may be directly obtained by formula (17). 

55 

n(c) = n(c) + 1 

g(c) = {(hd - base) / DR + (n(c) - 1) x g<c)} / n(c) (17) 



10 



NiSGCCO <E = 



GS35973A-, i > 



EP 0 635 978 A1 



The formula (16) represents the process for successively storing the values of center of gravity 
obtained in the memory. On the other hand, the formula (17) represents the process for storing the 
cumulated value of the former values of center of gravity in the memory and for dividing the total of the 
cumulated value and the present value of center of gravity by the number of times of cumulation. As 
described above, formulas (16) and (17) represent the process for cumulating values normalized with the 
base value base and the dynamic range DR rather than the true value hd of the HD pixel. When the 
cumulated value for each class is stored in the memory and the cumulated value is divided by the number 
of times of cumulation, representative values may be obtained. However, since the cumulated value 
becomes large, the number of bits of the memory may increase. 

The value of center of gravity that has been determined and learnt is used for the signal converting 
process. In other words, as with the first embodiment, a class code representing a class is sent to the 
memory. The class code is stored in the memory as an address. The value of center of gravity g(c) of the 
class is read from the memory. With the value of center of- gravity g(c), predicted value hd* is generated 
corresponding to formula (18). 

hd' = DR x g(c) + base (18) 

In the first and second embodiments of the present invention, when blocks with dynamic range DR that 
is less than a predetermined value are excluded from data to be learnt, the learning process can be 
prevented from being affected by noise. 

In addition, according to these embodiments, predicted coefficients and representative values are pre- 
learnt. However, in a high speed system, the predicted coefficients and representative values can be 
obtained on real time basis. The predicted coefficients and representative values that have been learnt can 
be updated with those obtained on the real time basis. 

Either the method for predicting the values of all pixels of the output image signal with high resolution 
or the method for predicating the values of pixels that are not present may be selected. 

Moreover, the present invention may be applied to image signals other than interlace scanning TV 
signals, sequence scanning TV signals. 

Claims 

1. A signal converting apparatus for converting a first digital image signal into a second digital image 
signal having higher resolution than the first digital image signal, comprising: 

class categorizing means for categorizing a considered pixel of the second digital image signal as 
a class corresponding to a pattern of level distribution of a plurality of reference pixels of the first digital 
image signal, the reference pixels of the first digital image signal being disposed spatially and/or 
chronologically adjacent to the considered pixel; 

memory means for storing predicted coefficients for each class and for outputting the predicted 
coefficients corresponding to class information received from said class categorizing means; and 

predicted value generating means for calculating linear combination of the values of the pixels of 
the first digital image signal spatially and/or chronologically adjacent to the considered pixel so as to 
generate a predicted value of the considered pixel. 

2. The signal converting apparatus as set forth in claim 1 , further comprising a learning device for pre- 
determining the predicted coefficients, 

wherein said learning device comprises: 

means for forming the first digital image signal with the second digital image signal; 

class categorizing means for categorizing a considered pixel of the second digital image signal as 
a class corresponding to a pattern of level distribution of a plurality of reference pixels of the first digital 
image signal, the reference pixels of the first digital image signal being disposed spatially and/or 
chronologically adjacent to the considered pixel; and 

means for determining predicted coefficients for each class so that the sum of square of an error 
between the irue value of the considered pixel and the predicted value becomes minimum when the 
predicted value of the considered pixel is generated by linear combination of a plurality of pixels of the 
first digital image signal and the piedicled coefficients. 

3. A signal converting apparatus for converting a first digital image signal into a second digital image 
signal having higher resolution than the first digital image signal, comprising: 
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class categorizing means for categorizing a considered pixel of the second digital image signal as 
a class corresponding to a pattern of level distribution of a plurality of reference pixels of the first digital 
image signal, the reference pixels of the first digital image signal being disposed spatially and/or 
chronologically adjacent lo the considered pixel; 

memory means for storing representative values for each class and for outputting the representa- 
tive vlues corresponding to class information received from said class categorizing means; and 

predicted value generating means for generating the representative values received from said 
memory means as a predicted value of the considered pixel. 

4. The signal converting apparatus as set forth in claim 3, wherein a base value of a block composed of a 
plurality of pixels of the first digital image signal and a value normalized by a dynamic range is used as 
representative values. 

5. The signal converting apparatus as set forth in claim 3, further comprising a learning device for pre- 
determining the representative values, 

wherein said learning device comprises: 

means for forming the first digital image signal with the second digital image signal; 

class categorizing means for categorizing a considered pixel of the second digital image signal as 
a class corresponding to a pattern of level distribution of a plurality of reference pixels of the first digital 
image signal, the reference pixels of the first digital image signal being disposed spatially and/or 
chronologically adjacent to the considered pixel; and 

means for determining representative values of the second digital image signal for each class by 
mean value calculating process. 

6. The signal converting apparatus as set forth in claim 1 or 3, wherein said class categorizing means is 
adapted to categorizing the considered pixel of the second digital image signal as a class correspond- 
ing to a pattern of a one-dimensional level distribution of a plurality of reference pixels of the first digital 
image signal. 

7. The signal converting apparatus as set forth in claim 1 or 3, wherein said class categorizing means is 
adapted to categorizing the considered pixel of the second digital image signal as a class correspond- 
ing to a pattern of a two-dimensional level distribution of a plurality of reference pixels of the first digital 
image signal. 

8. The signal converting apparatus as set forth in claim 1 or 3, wherein said class categorizing means is 
adapted to categorizing the considered pixel of the second digital image signal as a class correspond- 
ing to a pattern of a three-dimensional level distribution of a plurality of reference pixels of the first 
digital image signal. 

9. The signal converting apparatus as set forth in claim 1 or 3, wherein said class categorizing means is 
adapted to compress and encode the values of a plurality of pixels of the first digital image signal and 
categorize the considered pixel of the second digital image signal as a class corresponding to the 
encoded values. 

10. The signal converting apparatus as set forth in claim 1 or 3, wherein the predicted coefficients or 
representative values of pixels adjacent to the second digital image signal are used in common so as to 
save the storage capacity of said memory means. 

11. The signal converting apparatus as set forth in claim 1 or 3. wherein the first and second digital image 
signals are interlace scanning sequence TV signals. 
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Fig. 4 
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Fig. 6 



2 5 5 



1 9 1 



1 2 7 




ONE BLOCK 



ONE BLOCK 



Fig. 7 



(a) 
MAX 



MEAN 
VALUE 



3 
2 
1 

0 



MEAN 
VALUE 



(b) 
MAX' 



7 
6 
5 
4 
3 
2 
1 
0 



M I N' 



M I N 



-DOCiD -H= J-335979A1 I > 



15 



EP 0 635 978 A1 



Fig. 8 
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Fig. 10 
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Fig. 12 
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Fig. 13 
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Fig. 14 
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Fig. 16 
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Fig. 18 
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